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<160> 7 

<170> PERL Program 

<210> 1 

<211> 1301 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 238040 

<400> 1 

agagccacag ataccctacc cggagcccac ctggccaccc ccgctcagtg cccccagggt 60 
cccctaccac tcctcagtgc tctccgtcac ccggcctgtg gtggtctctg ccacgcatcc 120 
cacactgcct tctgcccacc agcctcctgt gatccctgcc acacacccag ctttgtcccg 180 
tgaccaccag atccccgtga tcgcagccaa ctatccagat ctgccttctg cctaccaacc 240 
cggtattctc tctgtctctc attcagcaca gcctcctgcc caccagcccc ctatgatctc 300 
aaccaaatat ccggagctct tccctgccca ccagtccccc atgtttccag acacccgggt 360 
cgctggcacc cagaccacca ctcatttgcc tggaatccca cctaaccatg cccctctggt 420 
caccaccctc ggtgcccagc taccccctca agccccagat gcccttgtcc tcagaaccca 480 
ggccacccag cttcccatta tcccaactgc ccagccctct ctgaccacca cctccaggtc 540 
ccctgtgtct cctgcccatc aaatctctgt gcctgctgcc acccagcccg cagccctccc 600 
caccctcctg ccctctcaga gccccactaa ccagacctca cccatcagcc ctacacatcc 660 
ccattccaaa gccccccaaa tcccaaggga agatggcccc agtcccaagt tggccctgtg 72 0 
gctgccctca ccagctccca cagcagcccc aacagccctg ggggaggctg gtcttgccga 780 
gcacagccag agggatgacc ggtggctgct ggtggcactc ctggtgccaa cgtgtgtctt 840 
tttggtggtc ctgcttgcac tgggcatcgt gtactgcacc cgctgtggcc cccatgcacc 900 
caacaagcgc atcactgact gctatcgctg ggtcatccat gctgggagca agagcccaac 960 
agaacccatg ccccccaggg gcagcctcac aggggtgcag acctgcagaa ccagcgtgtg 1020 
atggggtgca gacccccctc atggagtatg gggcgctgga cacatggccg gggctgcacc 1080 
agggacccat gggggctgcc cagctggaca gatggcttcc tgctccccag gcccagccag 1140 
ggtcctctct caaccactag acttggctct caggaactct gcttcctggc ccagcgctcg 1200 
tgaccaagga tacaccaaag cccttaagac ctcagggggc gggtgctggg gtcttctcca 1260 
ataaatgggg tgtcaaccgt ttaaaaaaaa aaaaaagggc g 1301 

<210> 2 

<211> 5679 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 411448 

<220> 

<221> unsure 

<222> 1657-1658, 1663, 1667, 1672, 1675 
<223> a, t, c, g, or other 

<400> 2 

agcccccaac cccccgaggc cactcacctc ccccaactac ccaggacaga ggatgcccag 60 
ccagccgagc tccgggcagt acccgccccc cacggtcaac atggggcagt attacaagcc 120 
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agaacagttt aatggacaaa ataacacgtt 
aagggaatgt caacaggcct cccaggccgg 
ttccagggaa ccccacaccc cccatgaccc 
ccagccaaga cgtcaaacca cccttcccgc 
caccaccccc agccaaccac aatgacgagc 
tggtgctgga gcccttccgc ctggagcaca 
tgcggcccac ggtccaccag acgctgatgt 
gctaccacca cgaggaccgg cagatgaaca 
tgaacgccac gcccctcacc attgagcgcg 
acctgaagca cgtgtgccag ccgggccgca 
gctgctccca cctcttcgtg ctgcagctgg 
aaggactcct caagaagcgc ctcctgcccg 
atttcagcag cgtggctgcc tcctcgggca 
agcagacggc catcaaggtg tctctgaagt 
ctgctcgagg acacgattgc aagcatgtgc 
tgaattgcga gagagggacc tggaggtgtc 
gcctggaggt ggatcagtac atgtggggaa 
aagaggtcac catcgatccc acgtgcagct 
acatcaagga tgaccctgat ggcatcccct 
agatgatcat gcccaatgtc atggagatga 
atcccctccc gcctccccca gggggcacca 
actaccaagg ccatggcaac tttgacttcc 
atgacttcat gcacgggccc ccccagctct 
ccgccctcga gaaacccctc agccacccca 
ctgaccagcc ccacccctcc atacaacaag 
cagggcctcc attacatcac agtggggctc 
agccgccaca ggccgctccc agcagccatc 
cagccttaga gggtcaggcc ggagcgcagg 
atctccttcc cgaactcaca aatcctgacg 
tgccgagcaa tagtaacgat gacctcctgt 
gtcgggccca tccctccaca ctctgcatcc 
tgggagcctg tgccctcaga ccgccccgca 
ctcccccgct gcagccctct cagaacagag 
aggctgtgtg ggtctggagc ccacgtccca 
agcgcaggcc tgaagaccac cctcccgaga 
atgcggcttc ccggtccctc cgcgtgtgcc 
gttcttccat cttctagact gtaaccctgc 
cagtgactgt ggagcctgag aaggcccccg 
agcactggca gttggtggca gtgagaccag 
acaaacctct gggaaagaca acgtctctcg 
cctataagcc aagatacccc ataaacacac 
gtgtttgaga gggggtctgc cattcctgct 
cttctgagcc agacgtccct gaggtccacc 
acagagagag caggtgaaca cccaaccaaa 
catataacct gtcagaagca aacaggagcg 
ctttttaaac agcaacaatt taaagctatg 
tggacagcaa gcaaaccatt tctctccgtc 
ccacctctcc aagacttccg tgggacaccc 
atcagatggc aagggcagtg cgtggaaagg 
caatggtgtc tgtccagccc ctccctctgt 
agcccaggcc atggacatgt gcaccagtat 
tgtttctggg cctgccccag actactgccc 
tccaccatca caatctcacc caaactcctg 
ccctccaccg ctttgctcca tctggcttac 
ctgtgttcac tttgtttcag gctggtctgt 
ccaggttgtc ccgtcactgg ggtcccatct 
cctggggccc tttcctgctc tcccgtccgc 
ttttaccgga aaggtggccc cagctgttga 
ttttctgtag gaaagctgcc attgccccgg 
gttttttcaa atagcgtgtt gttcagtatg 
aaatatcttt gtgaatattt tagtatcgtc 
gttatagcct ttgctggtgt ttttaaaata 
tttgtgttct ttaaacaaaa acaaaaaaag 
tgaacagaat gggcggctgt gctgggagtt 
atccctcagt cagctggcag ggacaagcca 
agtgtctgtg gcctggcaag tggccacgcc 



ctcgggaagc agctacaagt aactacagcc 180 
ttcctgtggc aaattacccc cactcacctg 240 
ctgggagcag catccctcca tacctgtccc 300 
ctgacatcaa gccaaatatg agcgctctgc 360 
tgcggctcac attccctgtg cgggatggcg 420 
acctggcggt cagcaaccat gtgttccacc 480 
ggaggtctga cctggagctg cagttcaagt 540 
ccaactggcc cgcctcggtg caggtcagcg 600 
gcgacaacaa gacctcccac aagcccctgc 660 
acaccatcca gatcaccgtc acggcctgct 720 
tacaccggcc ctccgtccgc tctgtgctgc 780 
cagagcactg tatcacgaaa atcaagcgga 840 
acacgaccct caacggggag gatggggtgg 900 
gccccatcac attccggcgc atccagctgc 960 
agtgctttga tctggagtca tacctgcagc 1020 
ctgtgtgcaa taaaaccgct ctgctggagg 1080 
tcctgaatgc catccaacac tccgagtttg 1140 
ggcggccggt gcccatcaag tcggacttac 1200 
ccaagcggtt caagaccatg agtcccagcc 1260 
tcgcagccct gggccccggc ccgtccccct 1320 
actccaacga ctacagcagc caaggcaaca 1380 
cccacgggaa ccctggaggg acatccatga 1440 
cccacccccc ggacatgccc aacaacatgg 1500 
tgcaggaaac tatgccacac gctggcagct 1560 
gtttgcacgt accacacccc agcagccagt 1620 
ctcctcnnca tcnttcncag cntcnccggc 1680 
cacacagcga cctgaccttt aacccctcct 1740 
gagcgtccga catgccggag ccttcgctgg 1800 
agctcctgtc ttatctggac ccccccgacc 1860 
ctctatttga gaacaactga gggccacccg 1920 
taccccacct acccaacaca cttttccacc 1980 
ccagagccac gggctgtggg gcggggagcc 2040 
gggtagggag ggtgcaccag tgcaccagga 2100 
cctccacacc cttggcttgg gcccatgccc 2160 
ggaaccagcc cggtaagagg gcacacgctg 2220 
gattccaaat gaccttccag tgtccccaag 2280 
ctccctgctt cctggtccag agcctccctc 2340 
ggccccagca tgggccccga gccttggagg 2400 
gccacccacc accacccacc acagaaaagc 2460 
ggggccaggg gtcatcggtt tgacccctga 2520 
tcagaaagca gagaaaaagg acaagagtct 25 80 
tggggactgg tggggaagag ggccagacat 2640 
tccaagctca gacaggccca ggctttggga 2700 
gtgattgtgc ccttggttgg ggggcgcggg 2760 
gcaacttcta actttgctcc aagccactct 2820 
aagtcacctg gagaaaagga acgttgctct 2880 
tgttgtgttt ttctcctagt acctcacctg 2940 
acttccctct gtcctagttc tctttgtcca 3000 
ccggggaggt gcagaaacca gagcccaggg 3060 
ccctgtgctc caagcctgtc ccccggctgc 3120 
gtacctgcag gcatcagggg gaggggggcg 3180 
ttggctgcca gcctaccctg cctgcactcc 3240 
ctcactcaag caaaagcagc ctctggcctt 3300 
cactctccag ggcctcctgg ggagcctgtc 3360 
gccccgtgag ccacatggcc tagggtgatg 3420 
gtaaattctt tgcgcccttc ccggctgctg 3480 
tgtgggtggt ccccagcact cctctgtggg 3540 
cttccagtca ctgtcccaga cggcacaagg 3 600 
ccccttttct tcctttgtcc cgttgtcgag 3660 
caaatcaatt attttaagaa tcgcttttgt 3720 
tttgataata ttcaacattt tcatgacctg 3 780 
cctggactca atgacaaaga ccgagtcttc 3 840 
caaccagggc tatttgtaca gttgaagggg 3900 
ggaagaccgg gcagcccgct atttagagcc 3960 
acgccaggta gcatgtggcc acccttgccc 4020 
ctgtgtcaga ccatctggga attaagctcc 4080 
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agacagactt acagatgcct tccttaggag ttcttgcttc ttgcgttgat actttgcccc 4140 
agaaaggcct gggattcatt ctggttctta tcagggtgtg tccacactct gctcacaggt 4200 
ggatccacgg ctttccagtg cagagagtcg agatgctccc tgcagcccag gccccgggca 4260 
cctcctgcaa ccatctctgg gctcagcacc tgaggcgggt ttcctgggtc ccctctccag 4320 
caagcctcca ccagcaagct cggcccagag cttcccttcc ggctggctct gaaccgtgcg 4380 
tggtgcctac agcctgcagt ctggagacaa gctcttccgg agtgctctgg gagccaggcc 4440 
agggtgtgag ggaggtgcag aggcatccgg ggcgggagca agccccaggt tgtgacaggt 4500 
gcaggtagac aacgcccata aacagagatg gtcctgaact ctggagagat ccttccctga 4560 
tcctttcgga cgactacttg gagccataag taacctcagc aaaaacgagg cctctgcaag 4620 
ccacttttcc atgccaagca tccacccggc ccacaggcat gtttctgccg ccactccgca 4680 
agatggacag ggagccagca ggcaggcggg aagggccaag tacaggcaat cacccccatc 4740 
ttcttggttt gaagctttat ccatgtatca tgttccgtgt agccatttta ttttttaaga 4800 
aactgctaat actttctccc taatggaagc cctgatcccc cagagagcta caggtctgct 4860 
cccgacgggc ctcgggcctg acccgtccac acagggccgt gtcaacagca gcgactcaag 492 0 
ggacgtgtgt acatatgtaa atgagaaata gagacgtgtc aacagatgca ttcatttctc 4980 
ttggaatgtg tattgttttt attttgcgaa acaaaacaaa acaaaaaaaa aagcttggaa 5040 
ctccatcacg tggaaaaact agatcctgtt ggttatagca tttgtgagtt ctccacgtct 5100 
gtctctctcg ctcatgtaat atactctgac cctgagtgga aaggggtttt tgttctgttt 5160 
ttattttacc tacatgtact atttagcttc agtgtactag tcctgccacc tgtgtatttt 5220 
tagggtgcta tggaaataat gaaaagaaac ggggatttca gaagaaaatt gtaaccaaat 5280 
tcatactttg tataattttt gatatcatga tcacaggtga ttcacacgta cacacataaa 5340 
cacacccacc agtgcagcct gaagtaactc ccacagaaac catcatcgtc tttgtacatc 5400 
gtatgtacaa tgcaatcatt tcatacttta aactggtcaa aaaactaatt gtgatttcta 5460 
u gtcttgcaaa gctgtatgta gttagatgat gtgacaacct ctaatattta tctaataaat 552 0 
<2 atgtattcag atgaaacctg tatattaggt gttcatgtgg ttattttgta tttaaagatc 5580 
U aaattatttg actattgcta gacatttcta tactctgttg taacactgag gtatctcatt 5640 
|_j tgcccatgtt aatttttttc taaataaatt gacaaaaac 5679 

't\ <210> 3 

S <211> 3513 

H 0 <212> DNA 

<213> Homo sapiens 

a ~ <220> 

£3 <221> misc_f eature 

rr <223> Incyte ID No: 454163 

'tl <400> 3 

85 cggggaggaa ggagtgtgca gagtgtcacc attcaggtgt cctgggaaag atgaaagcac 60 
U ccccgccaag aagacagaca ttccctggcg gctgaagcag atgctggata tcctggtgta 120 
ry tgaagagcag cagcaggcgg ccgcggggtg aggcagggcc ctgcctggag tacctgctgc 180 
agcacaagat cctggagact ctctgcacgc tgggcaaggc cgagtacccc ccaggcatgc 240 
ggcagcaggt gttccagttc ttcagcaagg ttctggcgca ggtgcagcac cccctgctgc 300 
attacctcag cgtccacagg cctgtgcaga aactcctccg acttggtggg actgcttccg 360 
gatccgttac agaaaaggag gaggtgcagt tcaccaccgt cctctgctcc aagatccagc 420 
aggacccaga gctgctcgcc tacatcctgg aaggtaaaaa gattgtaggt aggaagaaag 480 
catgcggaga acccactgcc ctgcctaagg acacaaccag ccacggggac aaggactgct 540 
cccacgatgg tgctcctgcc aggccccagc tggacgggga gtcctgtggg gcccaggcct 600 
tgaacagcca catgcctgct gagaccgagg agctggacgg tgggaccaca gagagcaacc 660 
tgattacctc cctgcttggg ctgtgccaga gcaagaagag tcgggtggcc ttgaaggccc 720 
aggagaacct gctgctcctg gtgagcatgg cctccccagc agctgccacc tacctggtac 780 
agagcagcgc ctgctgccct gcgatcgtcc ggcacctttg ccagttgtac cggtccatgc 840 
ctgtcttcct ggaccccgca gacattgcca ccttagaggg catcagctgg aggttaccca 900 
gtgccccgtc tgatgaggct tccttccctg gcaaggaggc cttggctgcc ttcttgggct 960 
ggtttgatta ctgcgaccac ctcatcacag aggcacacac ggtggttgcg gacgccttgg 1020 
cgaaggctgt ggctgagaac ttcttcgtgg agaccctgca gccccagctc ctgcacgtgt 1080 
ccgagcagag catcttgacc tccaccgccc tcctcacagc catgctgcgc cagcttcgct 1140 
cccctgcgct gctgcgggag gccgtggctt tcctcctggg cacagaccgg cagcctgaag 1200 
cccccgggga caacccccac accctgtatg ctcatctcat cgggcattgt gaccacctct 1260 
ctgatgagat cagcatcacc acactccggc tgtttgagga gctgctgcag aagccccacg 1320 
aggggatcat ccacagcctg gtcctgcgca accttgaggg ccgcccttac gtggcctggg 1380 
gctcaccaga gcctgagagc tatgaggaca ccctagacct ggaggaagac ccctacttca 1440 
, ccgacagctt cctggattcc ggctttcaaa ctcccgcaaa gcctcgccta gctcctgcta 1500 
ccagttacga tggcaaaaca gcagtgaccg agatcgtcaa caggagtgca gctcccgcgt 1560 
cgcctcctgg ggctggcctc tgacccccac acctttggac ccccatgagc ccgagcgacc 1620 
tttcttcgag ggccacttcc tccgagtgct gtttgaccgc atgtcccgga ttctggatca 1680 
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gccatacagc ctgaacctgc aggtgacctc ggtcctgtcc cggcttgccc tcttccccca 1740 
cccccatatt catgagtacc tgctggatcc gtacatcagc ctggcccccg gctgcaggag 1800 
cctattctcc gtgttggtga gggtgatcgg ggacttgatg cagagaatcc agagggtacc 1860 
ccagttccca ggcaagctgc tcctggtgcg caacagttga cgggccaggc tcctggggag 1920 
cagctggacc accagaccct cctccagggc gtggtggtgc tggaggagtt ctgcaaggag 1980 
ctggctgcca ttgccttcgt caagtttccc ccacatgatc ctcgccagaa cgtctcccca 2 040 
gccccggaag ggcaggtctg agccagcacc agggcggtgg gagactcctg tccacacctc 2100 
tgccccagag ctgcctcctg cctggcactg ccgccacact cccctcctgg gatggggctt 2160 
ctgctcccgg gctcactcaa ggagactgcg gcatgttgac cacaccagac tgggtttcag 2220 
ggaatgggca tgccaggtgc caaggagcca aacagatggc tttccaggca gcaaggtcct 2280 
tggggccttc ttggaggagc ttgggtgaca gccaggtgag cacccagacc ccagaccctc 2340 
atgtgctgtg tgcctggccc cttctgtact ggccatttgt ggccagggcc aagcctgtga 2400 
ctcaactcca ggggcaagat ggggagtgag ctgatggctc cgagactggt caggagccca 2460 
ggccagtgag atggggcctg gagccttgtc tgtgtcacat taggtaccat gggagctgct 2520 
gagacctgac attttgtccc ctgcctacat ggcttggccc atggagaagg agcagtgaat 2580 
gggatcgtcg gggaagcccc tcttcctgct ctgctcccct ggaaactgtt gcaaaactcc 2640 
cagccgcctc atggcaaatg cccaaagcat gttccgcacc caggcggggg cccctgctaa 2700 
tgagaacctt ggtgcagctg cagccaggag gggagcgggc ccaggagcca ggctcaggtc 2760 
cagctggttc ctctctggcg ccttctgaac ccgtctcagc aggtccacag cacctgggca 2820 
gaggtcagag accaggggag gccgggcctt gccctccctt ctgcccaggg cccagtgttc 2880 
ttgatagaag acccttctgg ggagccaggg agctcagggg acagataagg gaaggacgcc 2940 
ccctgactcc aggcccctga gcctggcggg aagtggctgc ggcccaggca gccagtcctg 3000 
gtggtgttct ccctgcatgc cctccgtggc tgggctgcca ccccacccgg cccgaatctg 3060 
tcttgacctg caggaataca cgggcggcgc caggcattac ctcacagcgg gactacacag 3120 
ttgctggctt tgctcctggg caaggaggag caggccagag cctcttttgc ttccttttct 3180 
tgcccatgcc gcttctagaa gccaggcaca ggttgccaag aggtgacacg aaacaggagg 3240 
aaactcagtg acctctgcct ctcccacatt cctccccgcg ggggaggacc tcgccgctct 3300 
gaagagcacc gtgcacatgt gggtgcacaa acgtgggtgt tggtgtggac ggggcgcaga 3360 
tctccgtgga tgaactgcgt ctggactctt agattcataa aatattcgag ggtttgggag 3420 
tcacagaccc tcccctctcc tcagtgcact ttggcatttg cacggtgtct tccccggaca 3480 
gcacagcaat aaatggtgtg attgcgtgga aaa 3513 

<210> 4 

<211> 2970 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 988966.17 

<400> 4 

gtgggtacaa gatgacggag ccgggcgcct ctcccgagga cccttgggtc aaggcaagcc 60 
ccgtgggcgc gcacgccggc gaggggaggg cgggtcgggc tcgtgcacgt aggggggccg 12 0 
gaagacgagg ggcttccctc ctgtccccaa agtcccccac gctctccgtg ccccggggct 180 
gcagagaaga cagctctcac cccgcgtgtg ccaaggtgga gtatgcctac agcgacaaca 240 
gcctggaccc cgatgatgag gacagtgatt accaccagga ggcctacaag gagtcctaca 300 
aagaccggcg gcggcgcgca cacactcagg ctgagcagaa gaggagggac gccatcaaga 360 
gaggctatga tgaccttcag accatcgtcc ccacttgcca gcagcaggac ttctccattg 420 
gctcccaaaa gctcagcaaa gccatcgttc tacaaaagac cattgactac attcagtttt 480 
tgcacaagga gaagaaaaag caggaggagg aggtgtccac gttacgcaag gatgtcaccg 540 
ccctaaagat catgaaagtg aactatgagc agattgtgaa ggcacaccag gacaaccccc 600 
atgaagggga ggaccaggtc tctgaccagg tcaagttcaa cgtgtttcaa ggcatcatgg 660 
attccctgtt ccagtccttc aatgcctcca tctcagtggc cagcttccag gagctgtcag 720 
catgtgtctt cagctggatc gaggagcact gtaagcctca gaccctgcgg gagattgtga 780 
ttggcgtcct gcaccaattg aaaaaccagc tttactgacc ggttcttgga aacctggaga 840 
acagccaaca agaggccctt gaatctctac gtggccactg aactgctggg cccgggagac 900 
tggactacaa cacctcacac tggtcagctg gtttctactt ggtgtttggt ttttcccagc 960 
cccattttat cttcagcgga gccgcggtgt ttgttttgtg aaagcttctg attaatttat 1020 
tatattgacg ataaaactca aacctaccca gccttccccc cactccatgg aagtccttgg 1080 
gatgggcgtc tgctctggac accccaaaga gctcctgccc tctcagccct ttattcaagc 1140 
ctcagatttc tgctcatgat ctacatagat ttggaaactg ttttcctctg ttttggtctc 12 00 
ttgggcaaca tttttggccc aagtttgggc aacatttggc ccaagtttgg gcattttggc 1260 
agtagctgta tgggagaaaa agagtaagag gaaatattcc cacagccatg aagggtgaaa 1320 
gggcaccttg tgcctagact agggctgcct ggtcagtccc aggtgaggcc aagggctttc 13 80 
tggccatctc agggaggggc caccaggttc ctcccctcac cccatattcc atcaccttcc 1440 
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tcctctgctc 
gcagaggcag 
tgcagttttg 
gtcaagcata 
ggggtggggt 
gcaggtcctg 
ccctagttct 
tccagaggaa 
ttcctttagc 
aaatgggctg 
tatgcttgtc 
caaccaaaaa 
accagtcctg 
tctctatccc 
catcagacag 
tctctctgta 
tcttcaatct 
atctctggtg 
ctctgctgca 
aggtcagcat 
ccgtacatct 
accgccttgc 
ggccggttct 
gcccggcctt 
ggtggtccgg 
tctgttggtt 



tgggtggtaa 
gtataacaca 
ctacggttac 
tcttggcctc 
gaaaagcgta 
agtgaagccg 
ccaaattcac 
attaaatatt 
attgttccaa 
ggtctacccc 
tcctgactgc 
caaggtagcc 
accgtgggcc 
aacttcctca 
ctctgtagcc 
cacctgctct 
ctctctgtag 
tggtcagggc 
agctctgcac 
cactcaccat 
tctctttgat 
ccagtccgtg 
gctcctgcag 
tactcatcgc 
gcgcggtagc 
cgttttggga 



gggaagccct 
gcactacata 
aattttgaaa 
tcccatgtct 
caaaagatac 
tgggccctcc 
ttctgccttc 
tttatataaa 
gtctaaatgt 
cagccaccag 
tctgcttaaa 
agtgcaagac 
cctcaggggt 
aagaactgct 
atcctcttcc 
ttctcttctg 
ccagcggcat 
actagataat 
cttagcagtg 
gtcaaactgg 
cttgccttgt 
ttcccgctgt 
gtacctcagg 
ctttcccgcc 
gggctcgtgg 
tcaccttgat 



cccggttccc 
ttggaaattt 
tattaactga 
cagtgttgcc 
ttaaaagggc 
aaatgctcgt 
ctcaggtttg 
attaaattat 
taacctcaag 
ccctcatcct 
ggtgaaagta 
atctcactct 
ctgggagtgt 
tcttgctctt 
tcttcctcca 
gagtcacatg 
tccttcttta 
tccttgagct 
agggccacga 
tcctgatccg 
tgcgccagct 
aggttcccga 
aggatcccgg 
acccaactca 
aaaaccccgg 



acaggctatg 
tttatttttc 
gcctcaaaat 
tgcatttctc 
tcctggggta 
tttatagcaa 
atatctggca 
aataaatatt 
ctactgcaat 
ctctacccag 
gcaggaacaa 
tctgacatcc 
gacgttgtaa 
ggggtatcct 
ctccttacag 
attggtagct 
actcctggat 
cagcctccat 
ttttgccatc 
caaaatagat 
gctccagcgt 
acacatcctg 
cggctcccgc 
gaaagccgga 
tctgttgtgc 



atgctgcatg 
taaataccaa 
caccctttct 
ccaggacttg 
cacaagccca 
cctctctcta 
ggtttgacta 
gccaaatgct 
ttagacaatg 
tgctctggtt 
caacaaaagc 
tgcagtcccc 
tcttcatccg 
tcaagtattg 
tacttctgcc 
gctttaatgt 
ttctttctgc 
gtagcggcag 
taggacttga 
cttctgcttg 
cttcaccacc 
ggagctgtag 
cgcagcttct 
cgttgtagtt 
tgatgcctgg 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2970 



<210> 5 

<211> 1671 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 254749.3 



<220> 

<221> unsure 
<222> 16 

<223> a, t, c, g, or other 



<400> 5 

ctggtcgtcg tctgcngcgg ctgcggcggc 
gcccggtccg catttccgcc ttccggcttc 
cgccccagcc ctcctgatcc ctcgcagccc 
tgatgatgat ggcgctgagc aagaccttcg 
acgacggcga gttctacatg atcggctccg 
gttctctgta caagagatac ccctcactct 
agaaaatagt tgcatcgtca catggtaaaa 
acacgactct agccaccagt gtgaccctgt 
atggcaacga tgagaagtac aaggctgtgt 
gggaacagaa ggccaagagg aacagccagt 
acttagatgc cgtgccatgc tccacaacca 
agagaacctt ccccctttgc tttgatgacc 
ctcagcccga ggtgctggtc cccatccggc 
gagacgcctt cacctggaac atgaatgaga 
tcctctgtga cgatctggat ttgaacccgc 
tcagacagca gatcgagtcc taccccacgg 
gcgtcatcat caagctgaac atccatgtgg 
gggacatgtc agagaaggag aactcaccag 
tggggttggg cggggagttt gtcaccacca 
ggcatcagaa gacctacgcc ttcagcgaga 
ggaacacggg cgatgcggac cagtggtgcc 
tggagaagaa gatccgcgac caggacagga 
cggccccggc ctggtaacca gcccatcagc 
tgggccgcct ctcctccatc ttctggcaag 



tgaggagccc ggctgaggcg ccagtacccg 60 
ggtttccctc ggcccagcac gccccggccc 120 
ggctccggcc gcccgcctct gccgccgcaa 180 
ggcagaagcc cgtgaagttc cagctggagg 240 
aggtgggaaa ctacctccgt atgttccgag 3 00 
ggaggcgact agccactgtg gaagagagga 3 60 
aaacaaaacc taacactaag gatcacggat 420 
taaaagcctc ggaagtggaa gagattctgg 480 
ccatcagcac agagcccccc acctacctca 540 
gggtacccac cctgcccaac agctcccacc 600 
tcaacaggaa ccgcatgggc cgagacaaga 660 
atgacccagc tgtgatccat gagaacgcat 720 
tggacatgga gatcgatggg cagaagctgc 780 
agttgatgac gcctgagatg ttttcagaaa 840 
tgacgtttgt gccagccatc gcctctgcca 900 
acagcatcct ggaggaccag tcagaccagc 960 
gaaacatttc cctggtggac cagtttgagt 1020 
agaagtttgc cctgaagctg tgctcggagc 1080 
tcgcatacag catccgggga cagctgagct 1140 
accctctgcc cacagtggag attgccatcc 1200 
cactgctgga gactctgaca gacgctgaga 1260 
acacgaggcg gatgaggcgt cttgccaaca 1320 
acacggctcc cacggagcat ctcagaagat 1380 
gacagaggcg aggggacagc ccagcgccat 1440 
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cctgaggatc gggtgggggt ggagtggggg cttccaggtg gcccttcccg gcacacattc 1500 
catttgttga gccccagtcc tgccccccac cccaccctcc ctacccctcc ccagtctctg 1560 
gggtcaggaa gaaaccttat tttaggttgt gttttgtttt tgtataggag ccccaggcag 1620 
ggctagtaac agtttttaaa taaaaggcaa caggtcatgt tcaatttctt c ' \S11 

<210> 6 

<211> 635 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 411448 

<220> 

<221> unsure 

<222> 552, 554, 557-558 

<223> unknown or other 

<400> 6 

Pro Pro Thr Pro Arg Gly His Ser Pro Pro Pro Thr Thr Gin Asp 
1 5 10 15 

Arg Gly Cys Pro Ala Ser Arg Ala Pro Gly Ser Thr Arg Pro Pro 
20 25 3 0 

Arg Ser Thr Trp Gly Ser He Thr Ser Gin Asn Ser Leu Met Asp 
35 40 45 

Lys He Thr Arg Ser Arg Glu Ala Ala Thr Ser Asn Tyr Ser Gin 
50 55 60 

Gly Asn Val Asn Arg Pro Pro Arg Pro Val Pro Val Ala Asn Tyr 
65 70 75 

Pro Hxs Ser Pro Val Pro Gly Asn Pro Thr Pro Pro Met Thr Pro 
80 85 90 

Gly Ser Ser He Pro Pro Tyr Leu Ser Pro Ser Gin Asp Val Lys 
95 100 " 105 

Pro Pro Phe Pro Pro Asp He Lys Pro Asn Met Ser Ala Leu Pro 
HO 115 120 

Pro Pro Pro Ala Asn His Asn Asp Glu Leu Arg Leu Thr Phe Pro 
125 130 135 

Val Arg Asp Gly Val Val Leu Glu Pro Phe Arg Leu Glu His Asn 
140 145 150 

Leu Ala Val Ser Asn His Val Phe His Leu Arg Pro Thr Val His 
155 160 165 

Gin Thr Leu Met Trp Arg Ser Asp Leu Glu Leu Gin Phe Lys Cys 
170 175 180 

Tyr His Hxs Glu Asp Arg Gin Met Asn Thr Asn Trp Pro Ala Ser 
185 190 195 

Val Gin Val Ser Val Asn Ala Thr Pro Leu Thr He Glu Arg Gly 
200 205 210 

Asp Asn Lys Thr Ser His Lys Pro Leu His Leu Lys His Val Cys 
215 220 225 

Gin Pro Gly Arg Asn Thr He Gin He Thr Val Thr Ala Cys Cys 
230 235 240 

Cys Ser His Leu Phe Val Leu Gin Leu Val His Arg Pro Ser Val 
245 250 255 

Arg Ser Val Leu Gin Gly Leu Leu Lys Lys Arg Leu Leu Pro Ala 
260 265 270 

Glu His Cys He Thr Lys He Lys Arg Asn Phe Ser Ser Val Ala 
275 280 285 

Ala Ser Ser Gly Asn Thr Thr Leu Asn Gly Glu Asp Gly Val Glu 
290 295 ~ 300 

Gin Thr Ala He Lys Val Ser Leu Lys Cys Pro He Thr Phe Arg 
305 310 315 

Arg He Gin Leu Pro Ala Arg Gly His Asp Cys Lys His Val Gin 
320 325 330 

Cys Phe Asp Leu Glu Ser Tyr Leu Gin Leu Asn Cys Glu Arg Gly 
335 340 345 
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Thr 


Trp Arg Cys 


Pro 


Val 


Cys 


Asn Lys 


Thr 


Ala 


Leu 


Leu 


Glu Gly 










"t r~ r\ 

350 








355 








360 




Leu 


Glu Val 


Asp 


Gin 


Tyr 


Met 


Trp Gly 


He 


Leu 


Asn 


Ala 


He Gin 




His 






365 








370 








375 




Ser Glu 


Phe 


Glu 


Glu 


Val 


Thr He 


Asp 


Pro 


Thr 


Cys 


Ser Trp 










380 








385 








390 




Arg 


Pro Val 


Pro 


He 


Lys 


Ser 


Asp Leu 


His 


He 


Lys 


Asp 


Asp Pro 










395 








400 








405 




Asp 


Gly He 


Pro 


Ser 


Lys 


Arg 


Phe Lys 


Thr 


Met 


Ser 


Pro 


Ser Gin 










410 








415 








420 




Met 


He Met 


Pro 


Asn 


Val 


Met 


Glu Met 


He 


Ala 


Ala 


Leu Gly Pro 




Gly 






425 








430 








435 




Pro Ser 


Pro 


Tyr 


Pro 


Leu 


Pro Pro 


Pro 


Pro 


Gly Gly Thr Asn 




Ser 






440 








445 








450 




Asn Asp 


Tyr 


Ser 


Ser 


Gin 


Gly Asn Asn Tyr Gin Gly His Gly 






Phe Asp 




455 








460 








465 




Asn 


Phe 


Pro 


His 


Gly Asn Pro Gly Gly Thr Ser Met Asn 






Phe Met 




470 








475 








480 




Asp 


His 


Gly Pro 


Pro 


Gin Leu 


Ser 


His 


Pro 


Pro 


Asp Met 




Pro 






485 








490 








495 




Asn Asn 


Met 


Ala 


Ala 


Leu 


Glu Lys 


Pro 


Leu 


Ser 


His 


Pro Met 




Gin 






500 








505 








510 




Glu Thr 


Met 


Pro 


His 


Ala 


Gly Ser 


Ser 


Asp 


Gin 


Pro 


His Pro 




Ser 


He Gin 




515 








520 








525 




Gin 


Gly Leu 


His 


Val Pro 


His 


Pro 


Ser 


Ser 


Gin Ser 


D 


Gly 






530 








535 








540 


PI 


Pro Pro 


Leu 


His 


His 


Ser 


Gly Ala 


Pro 


Pro 


Xaa 


His 


Xaa Ser 




Gin 






545 








550 








555 




Xaa Xaa 


Arg 


Gin 


Pro 


Pro 


Gin Ala 


Ala 


Pro 


Ser 


Ser 


His Pro 


«= 


His 






560 








565 








570 




Ser Asp 


Leu 


Thr 


Phe 


Asn 


Pro Ser 


Ser 


Ala 


Leu 


Glu 


Gly Gin 


p= 








575 








580 








585 




Ala Gly Ala Gin Gly Ala Ser 


Asp Met 


Pro 


Glu 


Pro 


Ser 


Leu Asp 




Leu 






590 








595 








600 




Leu Pro 


Glu 


Leu 


Thr 


Asn 


Pro Asp 


Glu 


Leu 


Leu 


Ser 


Tyr Leu 




Asp 






605 








610 








615 




Pro Pro 


Asp 


Leu 


Pro 


Ser 


Asn Ser 


Asn 


Asp 


Asp 


Leu 


Leu Ser 










620 








625 








630 




Leu 


Phe Glu 


Asn 


Asn 
























635 



















<210> 7 

<211> 268 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 988966.17 



<400> 7 










Met 


Thr 


Glu 


Pro 


Gly Ala 


Ser 


1 








5 




Ser 


Pro 


Val 


Gly 


Ala His 
20 


Ala 


Arg 


Ala 


Arg 


Arg 


Gly Ala 
35 


Gly 


Pro 


Lys 


Ser 


Pro 


Thr Leu 
50 


Ser 


Ser 


Ser 


His 


Pro 


Ala Cys 
65 


Ala 


Asn 


Ser 


Leu 


Asp 


Pro Asp 
80 


Asp 


Ala 


Tyr 


Lys 


Glu 


Ser Tyr 
95 


Lys 


Gin 


Ala 


Glu 


Gin 


Lys Arg 


Arg 



10 15 
Gly Arg Ala Gly Arg Ala 

25 30 
Gly Ala Ser Leu Leu Ser 

40 45 
Arg Gly Cys Arg Glu Asp 

55 60 
Glu Tyr Ala Tyr Ser Asp 

70 75 
Ser Asp Tyr His Gin Glu 

85 90 
Arg Arg Arg Ala His Thr 
100 105 
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110 






Asp 


Leu 


Gin Thr 


He 
125 


Val 


Pro 


He 


Gly 


Ser Gin 


Lys 
140 


Leu 


Ser 


He 


Asp 


Tyr He 


Gin 
155 


Phe 


Leu 


Glu 


Glu 


Val Ser 


Thr 
170 


Leu 


Arg 


Met 


Lys 


Val Asn 


Tyr 
185 


Glu 


Gin 


Pro 


His 


Glu Gly 


Glu 
200 


Asp 


Gin 


Val 


Phe 


Gin Gly 


He 
215 


Met 


Asp 


Ser 


He 


Ser Val 


Ala 
230 


Ser 


Phe 


Ser 


Trp 


He Glu 


Glu 
245 


His 


Cys 


Val 


He 


Gly Val 


Leu 
260 


His 


Gin 







115 










120 


Thr 


Cys 


Gin 


Gin 


Gin 


Asp 


Phe 


Ser 






130 










135 


Lys 


Ala 


He 


Val 


Leu 


Gin 


Lys 


Thr 






145 










150 


His 


Lys 


Glu 


Lys 


Lys 


Lys 


Gin 


Glu 






160 










165 


Lys 


Asp Val 


Thr 


Ala 


Leu 


Lys 


He 






175 










180 


He 


Val 


Lys 


Ala 


His 


Gin Asp Asn 






190 










195 


Val 


Ser 


Asp 


Gin 


Val 


Lys 


Phe 


Asn 






205 










210 


Ser 


Leu 


Phe 


Gin 


Ser 


Phe 


Asn 


Ala 






220 










225 


Gin 


Glu 


Leu 


Ser 


Ala 


Cys 


Val 


Phe 






235 










240 


Lys 


Pro 


Gin 


Thr 


Leu 


Arg 


Glu 


He 






250 










255 


Leu 


Lys 


Asn 


Gin 


Leu 


Tyr 










265 
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